CDS

Accession Number TCMCG017C21942
gbkey CDS
Protein Id OMO80090.1
Location complement(join(57355..57522,57613..58515))
GeneID InterPro:IPR002922
Organism Corchorus olitorius
locus_tag COLO4_24255

Protein

Length 356aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA215141, BioSample:SAMN03160584
db_source AWUE01018463.1
Definition Thiazole biosynthetic enzyme Thi4 family [Corchorus olitorius]
Locus_tag COLO4_24255

EGGNOG-MAPPER Annotation

COG_category H
Description Involved in biosynthesis of the thiamine precursor thiazole. Catalyzes the conversion of NAD and glycine to adenosine diphosphate 5-(2-hydroxyethyl)-4-methylthiazole-2-carboxylic acid (ADT), an adenylated thiazole intermediate. The reaction includes an iron-dependent sulfide transfer from a conserved cysteine residue of the protein to a thiazole intermediate. The enzyme can only undergo a single turnover, which suggests it is a suicide enzyme. May have additional roles in adaptation to various stress conditions and in DNA damage tolerance
KEGG_TC -
KEGG_Module -
KEGG_Reaction R10685        [VIEW IN KEGG]
KEGG_rclass RC00033        [VIEW IN KEGG]
RC03253        [VIEW IN KEGG]
RC03254        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K03146        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00730        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00730        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCAGCCATGGCAACAGCCTTCACCTCATTGTCTTCAACCCCCAAATCAGCTTTCTTAGACCAGAAGTCATCTTTCCATGGCACCCCAATCGCTTCCCGTTTCACCCCAATCAAATCCTCATCACAAAACTCCACCATTTCCATGTCCTTGAACACTCCTCCTTACGACTTGAACTCCTTTAAATTCCAACCCATTAAAGAATCCTACGTCTCTCGTGAAATGACCCGCCGTTACATGATGGACATGATCACTTACGCCGACACCGACGTCATCATCGTCGGCGCCGGCTCCGCCGGTCTTTCTTGCGCTTACGAAATTAGCAAGAACCCCAACATCCGTGTCGCCATAATCGAACAATCAGTTAGCCCTGGCGGCGGCGCGTGGCTCGGCGGCCAACTGTTTTCCGCCATGGTTGTCCGCAAACCAGCTCACAGATTCCTCGACGAGCTCGGCATCCAATACGACGAACAAGAAAACTACGTCGTGATCAAACACGCCGCTTTGTTCACATCAACAATAATGAGCAAGCTTTTGGCCAGGCCGAACGTGAAATTGTTCAATGCCGTGGCGGCTGAGGATTTGATAGTGAAAGAAAACAGAGTTGCCGGAGTTGTGACGAACTGGGCTTTGGTATCTATGAACCATGACACCCAATCTTGCATGGACCCTAATGTGATGGAGTCTAAAGTCGTAGTGAGTTCTTGTGGACATGATGGACCCTTTGGAGCCACTGGAGTGAAGAGATTGAAGAGCATTGGGATGATTGACAGTGTTCCTGGAATGAAGGCACTGGACATGAATACTGCAGAGGATGCGATTGTGAGGCTAACCAGGGAGATTGTGCCTGGAATGATTGTTACAGGAATGGAAGTTGCAGAGATTGATGGAGCCCCAAGAATGGGTCCAACATTTGGGGCAATGATGATATCAGGGCAGAAAGCAGCACATTTGGCCTTGAAGGCATTGGGGCAGCCTAATGAGATAGATGGAACCTTGAGTGAAGCTGGAAGAATACAGCCAGAGTTTGTTCTTGCTTCTGCAGAGACTGAAGATACTGTGGATGCTTGA
Protein:  
MAAMATAFTSLSSTPKSAFLDQKSSFHGTPIASRFTPIKSSSQNSTISMSLNTPPYDLNSFKFQPIKESYVSREMTRRYMMDMITYADTDVIIVGAGSAGLSCAYEISKNPNIRVAIIEQSVSPGGGAWLGGQLFSAMVVRKPAHRFLDELGIQYDEQENYVVIKHAALFTSTIMSKLLARPNVKLFNAVAAEDLIVKENRVAGVVTNWALVSMNHDTQSCMDPNVMESKVVVSSCGHDGPFGATGVKRLKSIGMIDSVPGMKALDMNTAEDAIVRLTREIVPGMIVTGMEVAEIDGAPRMGPTFGAMMISGQKAAHLALKALGQPNEIDGTLSEAGRIQPEFVLASAETEDTVDA